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Abstract. The power spectrum of fluctuations in the cosmic microwave background 
(CMB) depends on most of the key cosmological parameters. Accurate future 
measurements of this power spectrum might therefore allow us to determine h, SI, 
Sl6, A, n, T/S, etc., with hitherto unprecedented accuracy. In these lecture notes, we 
review the various physical processes that generate CMB fluctuations, focusing on how 
changes in the parameters alters the shape of the power spectrum. We also discuss 
foregrounds and real-world data analysis issues and how these affect the accuracy with 
which the parameters can be measured. 



1. Introduction 

These are exciting times for CMB researchers. The first detection of CMB fiuctuations 
(other than the kinematic dipole) were announced only three years ago [1], shortly 
before the 1992 Varenna cosmology summer school. Since then, the field has virtually 
exploded with activity, on both the experimental and theoretical fronts. On the 
experimental side, ground- and balloon-based experiments have now produced more 
than a dozen independent detections of fiuctuations, over a range of angular scales, 
and many more experiments are currently under way or being planned for the near 
future — see [2-4] for recent reviews. On the theoretical side, great progress has been 
made both in understanding and quantifying the various physical effects that generate 
anisotropies {e.g. [5-10]) and in developing data-analysis techniques that enable us 
to extract cosmological information from real- world CMB data {e.g. [11-19]). It now 
seems plausible that that the next generation of CMB experiments, together with these 

I To appear in Proc. Enrico Fermi, Course CXXXII, Varenna, 1995. 

Available from http://www.mpa-garchmg.mpg.de/' max /varenna. html (faster from Europe) 
and from from http://astro.berkeley.edu/' max /varenna. html (faster from the US). 
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theoretical advances, will allow accurate determination of many of the key cosmological 
parameters. 

Section 2 of the these lecture notes is an overview of the main physical processes 
that are produce anisotropics in the microwave sky, including foregrounds. Section 3 
gives a more detailed discussion of the so-called primary anisotropics, focusing on how 
changes in cosmological parameters alter the power spectrum in Figure 1. In section 4 we 
discuss how accurately these parameters can be measured given real- world problems such 
as pixel-noise, beam dilution, foreground contamination and incomplete sky coverage. 
Finally, Appendix A contains a brief primer on power spectra, spherical harmonics, etc. 
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Figure 1. The angular power spectrum of the standard CDM model [20] (SI = 1, 
fij, = 0.06, h = 0.5, n = 1). 

2. Sources of CMB fluctuations: an overview 

We will first give a brief summary of all the main effects, then return to a more detailed 
discussion of the primary anisotropics and their dependence on cosmological parameters 
in the following section. 

There are many different ways in which one may chose to categorize the various 
physical processes that affect the temperature distribution we measure in the microwave 
sky. One common approach {e.g. [21]) is to order them chronologically, according to 
how far back in time they occurred, and this is roughly the approach taken in Table 1. 
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Table 1. Sources of temperature fluctuations. 
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2.1. Primary fluctuations 

As the CMB photons decouple from the baryons around a redshift z ~ 10"^, about 
6000/i~^Mpc away from us, they take with them three different imprints of the region 
from which they last scatter, corresponding to the peculiar gravitational potential (/>, 
the radial peculiar velocity Vr and density fluctuation 5: 

(i) Photons last scattered in a potential well ((/> < 0) will experience a gravitational 
redshift as they climb out of it. 

(ii) Photons last scattered by matter whose peculiar velocity is away from us [vr > 0) 
will receive a Doppler redshift. 

(iii) Photons emanating from an overdense region [5 > 0) will have a higher 
temperature, simply because denser regions are intrinsically hotter. 
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These three effects correspond to the first three entries in Tabic f , and are summarized 
by the equation 



where the iength of the vector r is the comoving distance to the surface of last scattering, 
6000/i ^Mpc, and the fields (/>, v and 5 are to be evaluated at the time of 
recombination, at 2; ~ 10"^. Here and throughout, we use units where the speed of light 
c = 1. As we will discuss further on, adiabatic initial conditions lead to a situation 
where the locations of the overdensities coincide with those of the potential wells, so 
that the third term partially cancels the first. It turns out that for fiuctuations on very 
large scales a few degrees), 5 — 2(/), so that these two terms combine to simply </)/3, 
the so-called Sachs- Wolfe effect [22], which is responsible for the flat part on the left 
side in Figure 1. On smaller scales, the fluctuations in (/>, v and 5 have time to undergo 
a type of acoustic oscillations before recombination occurs, and as we will see further 
on, this is responsible for the so-called Doppler peaks further to the right in Figure 1. 

Since the last-scattering surface (LSS) is not inflnitesimally thin, the CMB 
temperature that we measure in a given direction on the sky will in fact be a weighted 
average, corresponding to a mixture of photons coming from the near and far parts of 
the LSS. This effect, listed as the fourth in Table 1 as a "negative source", effectively 
washes out fluctuations on scales below that corresponding to the LSS thickness, which 
corresponds to about i ~ 10"^ (6 ~ 0.1°) - see Figure 1. This is quite analogous to 
the way in which smoothing with a wide experimental beam washes out small-scale 
fluctuations, the only difference being that the latter smoothing is in the transverse 
rather than in the radial direction. 

We will return to these primary sources of fluctuations in section 2. Alternative 
scenarios based on topological defects are not discussed in these notes (the interested 
reader is referred to [23-31] and references therein), nor will tensor mode fluctuations 
be covered — for C^-formulas for this case, see e.g. [32,33] and references therein. 

2.2. Secondary fluctuations 

The effects listed under this heading in Table 1 roughly speaking refer to processes that 
affect the CMB photons on their way from the last scattering surface to us, i.e., in the 
redshift range < 2; < 10"^. 

2.2.1. Gravitational effects 

After recombination, the baryons have essentially lost their ability to interact with the 
photons through Thomson scattering, but will of course continue to affect the photons 
gravitationally just as they did before. These gravitational effects are conveniently 
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Figure 2. Primary and ISW contributions to the CDM power spectrum. Reprinted 
from [9]. 




Figure 3. The Rees-Sciama effect. Reprinted from [36]. 
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divided into four categories, as in Table 1. The first three (the Early ISW, Late IWS and 
Rees-Sciama effects) are all manifestations of the so called Integrated Sachs- Wolfe (ISW) 
effect [22], whereby time- variation in the gravitational potential imprints temperature 
fluctuations on the CMB. The ISW effect on a single photon is conveniently written as 
an integral along its flight path: 

^ = J Mt),t]dt, (2) 

where (j) is the conformal time derivative of (/> at a fixed position. This has a simple 
physical interpretation. If a photon files through a potential well, the blueshift it acquires 
when falling in will be exactly canceled by its redshift from climbing out. If the potential 
well becomes more shallow while the photon is in it ((/> > 0), the cancellation will no 
longer be perfect, and a net blueshift will result. 

In the linear regime after matter-domination, it is well-known that the peculiar 
gravitational potential (j) remains constant, in which case there is no ISW effect. There 
are three different cases when (/> 7^ 0, and the corresponding contributions to the ISW 
effect are usually called the early ISW, late ISW and Rees-Sciama effects, respectively. 

(i) Shortly after recombination, the photon contribution to the density of the universe 
is still not altogether negligible. As a result, (j) decays somewhat, causing the early 
ISW effect. Since the photon density is fixed by the CMB temperature today, 
this effect becomes more important when 0, the total density, is lowered. 

(ii) If A > 0, the universe will eventually become vacuum dominated. If -|- A 7^ I, 
the universe may become curvature dominated. In both these cases, (j) ^ 0. Since 
vacuum energy and curvature become important only at low redshifts, this is known 
as the late ISW effect. 

(iii) Once non-linear structures such as galaxy clusters form, linear perturbation theory 
of course breaks down, and the perturbation theory result (/> = is no longer correct. 
This effect is known as the Rees-Sciama effect [34]. 

The early ISW effect is schematically illustrated in Figure 2, and typically peaks slightly 
to the left of the first Doppler peak. The late ISW, on the other hand, tends to peak on 
the very largest scales, since it becomes effective only at late times [35]. In both cases, the 
effects drops off on scales much smaller than the horizon scale at their formation epoch, 
because of \/]V-type cancellation effects. The contribution to Ci from the Rees-Sciama 
effect is shown in Figure 3, reprinted from [36]. Although the magnitude of the Rees- 
Sciama effect has been subject to a long debate {e.g. [37-39]), it now seems as though for 
standard CDM models, its impact on the power spectrum is for all practical purposes 
negligible, being several orders of magnitude weaker than the primary anisotropics [36]. 
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Figure 4. Angular power spectra including (dotted lines) and excluding (solid lines) 
the effect of gravitational lensing. Reprinted from [40]. 



The ISW effect can be thought of as the photons receiving momentum kicks from 
the gravitational field gradient V</) parallel to their fiight path. These kicks thus change 
the energy of the photons, but not their direction. There is also a twin effect, caused 
by momentum kicks from V</) perpendicular to the flight path. To flrst order, this effect 
will leave the energy of the photon unchanged, but deflect its trajectory [41]. A recent 
evaluation of this intensely debated effect is shown in Figure 4, which is reprinted from 
[40] . If a pair of photons would have arrived here separated by an angle 6 in the absence 
of fluctuations in (/>, then they will in reality arrive separated by some angle ^ + A^. It has 
been shown (see [40] and references therein) that for typical models, AO/ 6 <^ 1, typically 
between 0.1 and 0.2. This means that this is essentially an issue of weak gravitational 
lensing. If we imagine the CMB fluctuations for (/> = painted on the inside of a rubber 
sphere, this lensing effect will correspond to stretching and compressing the rubber in a 
random fashion, much like the way our mirror images get distorted by non-flat mirrors 
in amusement parks. Since AO/ 6 <^ 1, this distortion will always constitute a one-to- 
one mapping of the image, i.e., there will be no caustics where the image "folds over" 
on itself. There is thus no smearing involved, so the total power in the fluctuations is 
conserved. Rather, as Figure 4 illustrates for two CDM models, the effect of this angular 
jumbling is to smear out the power spectrum somewhat, redistributing power from the 
peaks to the troughs. Although this effect is small (typically a few percent), it may well 
be detectable in future CMB experiments. 
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2.2.2. Effects of local reionization 

Above, we discussed the effects of (j) on tlie CMB pliotons as tliey free-stream to us 
after recombination. The other two fields, v and ^, can clearly only infiuence the CMB 
photons if the baryons become reionized. This may happen locally, confined to for 
instance hot clusters of galaxies, or globally, throughout all of space. 

Local reionization manifests itself in two ways, corresponding to the impact of v 
and ^, respectively. Both are known as the Sunyaev-Zeldovich (SZ) effect [42-44]: 

(i) If a cluster of galaxies is moving towards us, Thomson scattering of CMB photons 
off of free electrons in the hot intra-cluster gas will cause a Doppler blueshift in the 
direction of the cluster, known as the kinematic SZ-efFect. 

(ii) Independent of the cluster velocity, the high temperature of the free electrons 
will distort the Planck spectrum by depleting the Rayleigh-Jeans (low u) tail and 
overpopulating the Wien (high u) tail. This is known as the thermal SZ-efFect, 
and appears as a redshift below 218 GHz and as a blueshift above 218 GHz. 

Both of these effects are likely to have a negligible impact on the overall CMB power 
spectrum [45]. However, they can be quite large in the directions of cluster cores, and 
thus be used to used to learn more about both internal properties of clusters (see e.g. 
[46-48]) and about cluster peculiar velocities {e.g. [49,50]). 

2.2.3. Effects of global reionization 

If reionization is global, throughout space, the effects on the CMB power spectrum can 
be quite radical, as seen in Figure 5, suppressing fluctuations on small scales. This is a 
simple geometrical effect, illustrated in Figure 6. In the coordinates of this flgure, light 
rays propagate along 45° lines just as in Fuclidean space. The horizontal circles are 
labeled with their corresponding redshifts, the dotted circle corresponding to the big 
bang. If we (at the apex of the cone) detect a CMB photon arriving from the right in 
the flgure, then it would normally have been in the lower right corner at recombination 
at z = 10"^. However, if reionization caused the photon to Thomson scatter off of free 
electrons, with the last scattering occurring at z = 10, then the photon could have come 
from anywhere in the double-shaded region. In other words, the temperature we see in 
a given direction of the sky is really the weighted average of the temperature of part of 
the z = 1000 last scattering surface. This smearing will be on an angular scale 



corresponding to the angle subtended by the horizon (~ the backward light cone in 
Figure 6) at the redshift z of last scattering. 




(3) 
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Figure 5. The effect of reionization in standard CDM. Tlie curves correspond to 
models wliere tlie universe becomes suddenly ionized at the redshifts indicated and 
remains ionizaed until today. Reprinted from [9]. 



If the universe became reionized at a redshift z and remained ionized until today, 
the probabihty that a CMB photon would never get scattered is e~^, where the optical 
depth is (e.^. [51]) 

Thus a fraction of the photons remain unaffected by the reionization. Since the 
power is the square of the fluctuation amplitude, it will get suppressed by a factor e~^^. 

For standard CDM parameters, equation (4) means that even if r ^ 1, so that 
almost all photons got scattered at some point, the last scattering event for most photons 
will be around 2; ~ 50, corresponding to a maximum smearing scale ^ ~ 8°. 

In summary, global reionization will affect the power spectrum as follows: 

(i) The power on small scales (£ ^ 10) will be suppressed by a factor e~^^ 

(ii) The power on larger scales will remain unaffected. 

This is clearly illustrated in Figure 5. The solid curve in the figure reveals an additional 
effect: around the critical scale ^, extreme reionization causes a slight increase in power. 
This corresponds to new "primary" anisotropics from the new surface of last scattering. 
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z = 




Figure 6. Our backward light cone in a flat SI = 1 universe is shown in comoving 
coordinates, with conformal time rj on the vertical axis. One of the three spatial 
dimensions has been suppressed. 

and will exceed the old primary fluctuations because the fluctuations v and 5 have grown 
since z = 10^. Finally, the so-called Vishniac effect [9,52-56] is seen to generate new 
fluctuations on very small scales. This is a nonlinear (second-order) effect, related to 
coupling between large-scale fluctuations in Vr and small-scale fluctuations in 5. 

2.3. "Tertiary" fluctuations (foregrounds and headaches) 

Although the fluctuation sources listed as "tertiary" in Table 1 are of course not CMB 
fluctuations in the conventional sense, reliable parameter estimation from CMB data 
requires accurate knowledge of their properties. 

The frequency-dependence of the various foregrounds has been extensively studied, 
in both "clean" and "dirty" regions of the sky (see e.g. [57,58] for recent reviews). 
However, these properties alone provide a description of the foregrounds that is 
somewhat too crude to assess the extent to which they can be separated from the 
underlying CMB signal, since the foreground fluctuations depend on the multipole 
moment i as well (see [48] for simulations). Most published plots comparing different 
CMB experiments tend to show i on the horizontal axis and an amplitude [Ci or an 
r.m.s. AT/T) on the vertical axis, whereas most plots comparing different foregrounds 
show amplitude plotted against frequency u. Since the fluctuations in the latter tend 
to depend strongly on both i and i/, i.e., on both spatial and temporal frequency, one 
obtains a more accurate picture by combining both of these pieces of information and 
working in a two-dimensional plane as in Figure 7. Indeed, this i — u plane arises 
naturally in a minimum-variance subtraction scheme employing multiple frequencies 
[19]. This flgure also shows roughly the regions in this plane probed by various CMB 
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Figure 7. Where the various foregrounds dominate. The shaded regions indicate 
where the various foregrounds cause fluctuations exceeding those of COBE-normalized 
scale-invariant fluctuations, thus posing a substantial challenge to estimation of genuine 
CMB fluctuations. They correspond to dust (top), free-free emission (lower left), 
synchrotron radiation (lower left, vertically shaded), radio point sources (lower right) 
and COBRAS/SAMBA instrumental noise and beam dilution (right), all for the 
cleanest 20% of the sky. The heavy dashed line shows the frequency where the total 
foreground contribution to each multipole is minimal. The boxes indicate roughly the 
range of multipoles I and frequencies v probed by various CMB experiments. Reprinted 
from [19]. 

experiments. Each rectangle corresponds to one experiment. Its extent in the £- 
direction shows the customary FWHM width of the experimental window function (see 
e.g. [59]), whereas the vertical extent shows the frequency range that is covered. For 
single-channel experiments, we plot the quoted bandwidth, whereas for multi-channel 
experiments, the box has simply been plotted in the range between the lowest and 
highest frequency channel. For a more detailed description of these experiments, see 
[3] and references therein. The various shaded areas correspond to the regions where 
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fluctuations from various foreground sources become comparable to those of the CMB, 
and will be discussed in greated detail below. Comparing these with the locations of 
the various boxes, it is easy to understand which experiments are the most affected by 
the various foregrounds. 

The foreground estimates given below are all based on [19]. They are not intended 
to be very accurate, especially when it comes to normalization. Rather, the emphasis is 
on their qualitative features, especially those that differentiate them from one another. 
Despite the fact that we currently lack accurate high-resolution data in many important 
frequency bands, we will see that quite robust qualitative conclusions can be drawn 
about which regions of the i — u-plane will be most suitable for estimating various parts 
of the CMB power spectrum. 

2.4. Extragalactic point sources 

In this section, we will make estimates of the angular power spectrum Ci{h') for point 
sources. Here the ^-dependence is well known, but the i/-depencence quite uncertain. 
However, despite these uncertainties, we will see that radio point sources will contribute 
mainly to the lower right corner of Figure 7, whereas infrared point sources will 
contribute mainly to the upper right. 

It is easy to show [19] that, apart from the monopole, the power spectrum of 
unresolved point sources is 



where is the differential source count. In other words, we have defined n[(f)) as the 
number density per steradian of sources with fiux less than (f). Note that although Ci 
will depend on the frequency u of observation (since (f) does), it is independent of L This 
corresponds to a so-called white noise power spectrum, and will always dominate over 
a "fiat" CMB power spectrum at large enough L 

In real life, we are of course far from powerless against point sources, and can either 
attempt to subtract them by using spectral information from point source catalogues, or 
simply choose to throw away all pixels containing a bright point source. In either case, 
the end result would be that we eliminate all sources with a fiux exceeding some fiux cut 
(/)c, which then becomes the upper limit of integration in equation (5). Obviously, one 
can never eliminate all point sources, as there are for all practical purposes infinitely 
many of them. 

2.4.1. Radio sources 

The frequency dependence of the point source counts is unfortunately very poorly 
known in the microwave region of the spectrum. Based on data from the 1.5 GHz 




(5) 
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VLA FIRST point source catalog, and extrapolating to higher frequencies with a power 
law B[i') oc the power spectrum of unresolved radio sources is estimated to be [19] 



47r 



f sinh2(x/2) \ „ 



where x = hu/kTo. This corresponds to a 100 mJy flux cut at 1.5 GHz (for an all-sky 
survey, this would mean removing about 70000 sources). In Figure 7, the rather cautious 
assumption of an effective spectral index a = 0.0 for the population as a whole has been 
used. Flat-spectrum sources with spectral index a 0.3 are likely to dominate at higher 
frequencies [61], but this is of course only to be used as a crude first approximation, as 
the emission at higher frequencies is likely to be dominated by sources whose spectra rise 
and peak near those frequencies, and very little is currently known about the abundances 
of such objects — accurate data at 50 GHz and above is badly needed. 



2.4.2. Infra-red sources 

Our observational knowledge is no better at the high frequency end, where infrared 
emission from high redshift galaxies could play an important role. For instance, if this 
emission is dominated by dust in these galaxies with emissivity (3 = 2 (see the following 
section), we would expect a = —4 to be a better description at the higher microwave 
frequencies. Unfortunately, the differential source counts of such infrared point sources 
around 100 GHz is still completely unknown. For recent reviews of these issues, see 
[62,63]. 



2.5. Diffuse Galactic sources 

In this section, we discuss the qualitative features we expect for the angular power 
spectra of the diffuse Galactic contaminants, namely dust, free-free emission and 
synchrotron radiation. 



2.5.1. Power spectrum 

The power spectrum of Galactic dust has been estimated by a number of authors [64- 
66,19], using the IRAS all-sky survey [67] and the COBF/FIRAS data. The 100 micron 
IRAS map has an angular resolution of two arcminutes, i.e., better than most proposed 
CMB satellites would attain. Although the amplitude varies greatly with Galactic 
latitude, the overall shape is strikingly independent of latitude, and typically declines 
as Ci oc 1/P for i between 100 and a few thousand, steepening slightly on the smallest 
scales. 

Direct estimates of the power spectrum of synchrotron radiation are based on the 
Haslam 408 GHz map [68]. Although the angular resolution of this map is only of order 
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0.85°, i.e., far too low to provide information for £ ^ 100, the logarithmic slope has been 
found to be consistent with that for dust in the overlapping multipole range; around —3 
[66,19]. These results are hardly surprising, since even without analyzing observational 
data, one may be able to guess the qualitative features of the power spectra of the three 
diffuse components. Since they are all caused by emission from diffuse blobs, one might 
expect their power spectra to exhibit the following characteristic features: 

(i) Ci independent of i for small £, corresponding to scales much greater than the 
coherence length of the blobs (this is the standard Poisson behavior, and follows if 
one assumes that well separated blobs are uncorrelated). 

(ii) Ci falls off at least as fast as l/i"^ for very large £, corresponding to scales much 
smaller than typical blob sizes (this follows from the simple assumption that the 
brightness is a continuous function of position). 

(iii) If £'^Ci thus decreases both as £ gets small and as £ gets large, it must peak at some 
scale, a scale which we refer to as the coherence scale. 

The behavior of the contaminant power spectrum for very small £ (whether there is 
indeed a coherence scale, etc), is of course quite a subtle one, as the presence of the 
Galactic plane means that the answer will be strongly dependent on which patches of sky 
are masked out during the analysis. These issues, as well as the effect of non-Gaussianity 
and inhomogeneity, are discussed in detail in [19]. In Figure 7, we have simply assumed 
that all three components have a coherence scale of about 10°, corresponding to £ 10, 
and used power spectra of the simple form oc (5 + 



2.5.2. Frequency dependence 

The frequency dependence of the three components has been extensively discussed in 
the literature (see e.g. [69] and references therein). For synchrotron radiation and free- 
free emission. Figure 7 uses simple power laws B{u) oc For synchrotron emission, 
fi ~ 0.75 below 10 GHz [70], steepening to /3 ~ 1 above 10 GHz [71], so we simply 
assume (3 = 1 here. For free-free emission, we make the standard assumption (3 = 0.1. 
For dust, we assume a spectrum of the standard form 

Ci oc — — jTT^ . (7) 

Although an emissivity index /3 = 2 is found to be a good fit in the Galactic plane [72], 
we use instead the more conservative parameters T = 20. 7K, (3 = 1.36, which are found 
to better describe the data at high Galactic latitudes [69], since it is of of course the 
cleanest regions of the sky that are the most relevant ones for measurement of CMB 
fiuctuations. 
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2.6. Local sources 

In addition to the above-mentioned sources, all CMB experiments must of course tackle 
problems related to sidelobes from the sun, the moon. Earth and other planets, as 
well as electronic receiver noise. Ground- and balloon-based experiments will inevitable 
receive some atmospheric emission. On top of this, there are of course large numbers of 
potential sources of systematic errors. We will not comment further on most of these 
issues here, as they are often instrument-specific and must be modeled on a case-by-case 
basis. 

The only local fiuctuation source we will discuss here is receiver noise. Although 
we usually think of pixel noise as a problem of a different nature than the other 
contaminants, it has been shown [73,19] that it can in fact be described by an angular 
power spectrum Ci{h') and so be treated on an equal footing. One may ask what the 
point is of doing this, since the statistical impact of the noise is straightforward to 
calculate anyway. The answer is that it provides better physical intuition. Real world 
brightness data is of course discretely sampled as "pixels" rather than smooth functions 
known at every point r, but as long as the sampling is sufficiently dense (the typical pixel 
separation being a few times smaller than the beamwidth), this discreteness is merely 
a rather irrelevant technical detail. It enters when we do the analysis in practice, but 
our results are virtually the same as if we had continuous sampling. In the common 
approximation that the beam profile is a Gaussian, one finds that the noise power 
spectrum is [73,19] 




1(1+1) 




N 



where N is the number of pixels in the sky map, a is the r.m.s. pixel noise. 




(9) 



and FWHM is the fuU-width-half-max beam with. The so computed noise power for 
the proposed COBRAS/SAMBA satellite [74,19] is plotted in Figure 7, and should be 
fairly representative for next generation of spaceborne CMB missions. 
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3. The origin of primary anisotropics 



In this section, we discuss the generation of primary CMB anisotropics in somewhat 
greater detail, with the emphasis on how the power spectrum depends on various 
cosmological parameters. 

Traditionally {e.g. [75-78]), angular power spectra such as that in Figure 1 were 
computed with a "black box" approach: a gargantuan integration of the Boltzmann 
equation (see [79] and references in [9]), including all relevant physical processes, would 
be carried out numerically, and after an over-night run on a workstation, a power 
spectrum would result. Recently, great progress has been made [5,6,9,7,8,10] in providing 
an intuitive understanding of the inner workings of this black box, and it has been 
shown that power spectra accurate to within a few percent can be computed by much 
simpler means [8], by making certain physically motivated approximations. There are 
two distinct phases during which the calculations simplify considerably: 

(i) Before recombination: acoustic oscillations 

As long as the hydrogen is mostly ionized, the mean free path of the CMB photons 
is so short that for all practical purposes, we can treat the photon-baryon plasma 
as a single fluid, which at each point has a well-defined velocity and density. This 
plasma will undergo acoustic oscillations until it recombines, and it has been shown 
[8] that these oscillations can be accurately treated with the WKB approximation. 

(ii) After recombination: curvature and projection effects 

After recombination, things are complicated by the fact that the photons can stream 
freely and no longer have a well-defined velocity at each point. Rather, the evolution 
of their distribution in a 6-dimensional phase space must be computed. Fortunately, 
the fact that they are coUisionless makes this quite simple, since we know that 
they will merely follow a null geodesic though spacetime until they reach us. In 
the end, we want the power spectrum as a function of multipole i rather than 
wave number k, and if space is fiat, Ci is given by an integral over P{k) that 
has a simple geometrical interpretation. If space is positively or negatively curved 
(0 -|- A ^ 1), the geodesies will in addition converge or diverge so that the power 
spectrum in Figure 1 effectively shifts sideways, towards larger or smaller angular 
spaces, respectively. 

The remainder of this section is largely based on the Ph.D. Thesis of Wayne Hu [9], to 
which the reader interested in more details is referred (it is available via anonymous ftp 
from pac2.berkeley.edu). Most of these details can also be found in [7,8]. 
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3.1. Before recombination: acoustic oscillations 

We will first turn to an old familiar example, that of classical perturbation theory in a 
single self-gravitating fiuid. We will then present approximate results for the real-world 
case, where four components rather than one are present. After illustrating this with a 
simple toy model, we then discuss how the various real-world effects alter this simple 
solution and produce power spectra such as that in Figure 1. 



3.1.1. The classical Jeans analysis 



The classical equations governing a gas of density /?, velocity v and pressure p and the 
corresponding gravitational potential (j) are the continuity equation, the Euler equation 
of motion and the Poisson equation of classical gravity, respectively: 



P 



p + V ■ [pv] = 
V + {v- V)v = - 

= AirGp 

A simple solution to these equations is 

' Po{t,r) 
Vo(t,r) 



(10) 



a 



2ttGpo 



where a is a function of time that satisfies the Friedmann equation, and since the density 
is independent of r, this classical solution clearly corresponds to the unperturbed FRW 
solution in General Relativity. From here on, let us write these fields as functions of the 
comoving position x rather than the physical position r = a(t)x. Let us now expand 
the fields as 

f p(t,x) = po(t)[l + ^(t,x)], 

v(t, x) = Vo(t, x) + Vi(t, x), 

. </>(t,x) = </)o(x) + </)i(t,x), 

and assume that |^|, |vi|, <^ 1. Substituting this into equation (10), dropping all 
non-linear terms, Fourier transforming everything with respect to x, and doing some 
algebra, one obtains the second order ordinary differential equation 

/„2|l^|2 \ 



^(k) + 2-^(k) + 

a 



iwOpo S{k) = 0, 



'11 
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where hats denote Fourier transforms and Vs is the sound speed. This well-known result 
illustrates two general features: 

(i) If k < kc = {'ijiGpoY^'^a I there will be a solution where the fluctuations grow 
without limit, whereas i{ k > kc, the solutions will correspond to that of a damped 
harmonic oscillator. 

(ii) The damping rate is given by a/a, which we identify as the Hubble constant H . 
3.1.2. The relativistic multifluid case 

In the real world case, we need to consider relativistic effects, and four components 
instead of one: CDM, baryons, photons and neutrinos. CoUisionless neutrinos of course 
cannot be modeled with an equation such as equation (11), since they do not have a 
unique velocity at each point. We will ignore them in our simplified discussion here and 
refer to [9,80,83] for details, since once the universe has become matter-dominated at 
z 25000Oo/i^, massless or very light neutrinos will not contribute much to the total 
density. As mentioned above, before recombination, the baryons and the photons are 
so tightly coupled that we can treat them as one single fiuid, thus leaving us with only 
two components: CDM and the photon-baryon plasma. 

In a typical CDM scenario, the baryons constitute only a small fraction O^/O of 
the density of nonrelativistic matter, so after matter domination, equation (11) will 
accurately describe the CDM component if we merely set the pressure to zero [vs = 0). 
After solving this equation, we can then take the resulting gravitational potential 
as given and obtain a self-contained equation for the evolution of the photon-baryon 
plasma. This is done in detail in [9] (§5.2), and the result is 

^ + {tTr) ^ ^""'^^'^ ^ ^' ^^^^ 

where dots denote derivatives with respect to the conformal time rj = /(I + z)dt. The 
new variable is simply defined to be a third of the density fiuctuation, = 5/3. 
This is more convenient, as this is the actual temperature fiuctuation appearing in 
equation (l)t. The sound speed is given by 



I This is the origin of the factor 1/3: Since the photons are tightly coupled to the baryons, and the 
latter dominate the density, we have n^f (x rih (x ph ^ p. Furthermore, blackbody radiation satisfies 
(X , so we have T cx p^l^ . Hence the local temperature fiuctuation is 

1 dp 3 
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where 

\l + zj VO.OI5; ^ ^ 

is, apart from a constant factor, just the baryon-to-photon density ratio. Note that as 
2; — > 00, i? — > 0, so that the sound speed approaches the speed of light over \/3. 

Comparing equation (12) with equation (11), we notice that the self-gravity of the 
photon-baryon plasma (the 47rG'-term) has been neglected, with the assumption that 
Ofc <^ 0. Instead, the effects of gravity are all incorporated in the source term F in 
equation (12). It describes the gravitational impact of the cold dark matter component, 
and in a full relativistic treatment [9] it is given by 

Again, the (j) to be used here is to a first approximation the peculiar gravitational 
potential produced by the CDM alone. We see that a second difference between 
equation (11) and equation (12) is the damping term, which now vanishes if the sound 
speed (and hence R) is time-independent. The quantity (j)c is the perturbation to the 
spatial curvature. This general-relativistic quantity has no Newtonian analog. To a first 
approximation, (j)c ~ — </>. 



3.1.3. A toy model 

To get a qualitative feeling for the solutions of equation (12), let us first solve a toy 
model based on the following crude approximations: 

(i) The peculiar gravitational potential is time-independent, i.e., (/> = 0. 

(ii) The sound speed Vs (and hence R) is independent of time. 

The first approximation is valid in the matter-dominated [z ^ 25000/i^) linear regime, 
but clearly breaks down at very high redshifts. The second approximation is good as long 
as i? ^ 1, i.e., long before recombination, when Vg c/\/3. With these approximations, 
equation (12) reduces to 

l) + (y^kye = -j$. (17) 

We recognize this as the equation of motion for a simple harmonic oscillator with a 
time-independent driving force, and the solution is readily found to be 

Q{ri) = [0(0) + (1 + R)$\ cos{vM) + ^0(0) sm{vM) - (1 + i?)^.(18) 
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With the appropriate adiabatic initial conditions [9] 



§(0) = 0, 
0(0) = 



(19) 



the solution thus reduces to 



^iv) = 17 cos{vskr]) - (f) 



(20) 



very early on, when i? 0. Adiabatic initial conditions thus drive a cosine oscillation. In 
contrast, isocurvature initial conditions have 0(0) = and drive a pure sine oscillation 
— we will not discuss the isocurvature case here, but refer to [9] for details. 

3.1.4. The random element 

On average, the random fields (/>, v and 5 are of course all zero. Rather, the relevant 
quantities for our purposes are the variances, the power. Thus the contribution from 
the first and third terms in equation (1) is 



As mentioned above, the fields in equation (1) were to be computed on the last-scattering 
surface, which means that we are only interested in the solutions at the fixed time 
T] = rjr, the conformal time corresponding to recombination. Thus although we originally 



think of it as function of k instead, with t] held fixed. This is why the power spectrum 
in Figure 1 exhibits oscillatory behavior with respect to L 

As is discussed further on, although £ gS A; in a certain sense, the correspondence is 
only approximate, which leads to an additional smearing of the oscillations. 

3.1.5. The Doppler peaks vanish... 

The "Doppler peaks" we derived above exhibit a qualitatively correct behavior when 
compared with Figure 1. If (/> itself has a scale-invariant ("fiat") spectrum, the solution 
of equation (21) will have a fiat part for scales much smaller than the recombination 
horizon size (about 1°, corresponding to rj = rjr), followed by a series of peaks. However, 
these peaks in fact have no Doppler in them, since we left out the Doppler term r • v 
from equation (1). Let us now remedy this. For the adiabatic case, the peculiar velocity 
is given by [9] 





thought of equation (21) as an oscillating function of with k a mere constant, we now 
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(The first equal sign follows from the continuity equation and irrotational flow.) Since 
the statistical properties of the random field are rotationally invariant, the mean squared 
radial velocity is simply (|??rP) = (|vp)/3, so for i? = 0, equation (22) gives 

{\dr\^) = ^Vg\W{v,k'q). (23) 

If we approximate the combined contribution from all three terms in equation (1) by 
adding equation (21) and equation (23) incoherently, we obtain 

y[A] = ]-V[4>] kin\v,T^k) + co^\v,T^k)\ = ]-V[4>]. (24) 

In other words, when we included the Doppler contribution to the Doppler peaks, they 
vanished! It would thus appear as though we had made negative progress in this section. 

(a) Photons: Cs=c/V3 




Figure 8. The origin of "Doppler peaks": why density and velocity oscillations do 
not cancel. Reprinted from [9]. 



3.1.6. ...and return 

However, this cancellation occurs merely if the sound speed vs is exactly c/\/3, «• e., if 
i? = 0. This is illustrated in Figure 8, reprinted from [y]. In the right half of this figure, 
the solid curves correspond to the time-evolution of the combined density and gravity 
term, + (/>, as given by equation (18). The dashed curve shows the time-evolution of 
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the Doppler term, u^, as given by equation (22). In the upper half, the sound speed is 
Vg = c/\/3, so the solutions reduce to a cosine and a sine of the same amplitude, which 
give a mere constant when added in quadrature. When the sound speed is lower (which 
happens in reality, since R grows with time according to equation (15)), two changes 
occur, both shown in the lower half, and both immediate consequences of equations (18) 
and (22): 

(i) The amplitude of the dashed curve grows, but the amplitude of the solid curve 
grows much faster. 

(ii) The solid curve gets shifted upwards, acquiring a non-zero offset. 

In other words, the when the two curves are added in quadrature, the result will no 
longer be constant, but will exhibit bumps — the Doppler peaks have returned. The 
left-hand side of Figure 8 is an attempt to provide an intuitive understanding of what 
is happening in real space, in a potential well, for a single Fourier mode. The harmonic 
oscillator analogy is illustrated by two balls (whose distance from the top of the well 
represents 0, the overdensity in the photon-baryon fluid, and whose velocity represents 
the fluid velocity) connected by a spring (representing the pressure attempting to resist 
compression). Since we are assuming that (j) is time-independent, the potential well 
itself will not change its shape over time, but both the density and the velocity of the 
matter in the well will oscillate. The adiabatic initial conditions (19) mean that initially, 
there is a slight overdensity inside the well (0 = — (2/3)</)), with no fluid velocity at 
all (v oc = 0). This corresponds to the balls being at rest at the dashed circles. 
Gradually, more matter starts falling into the well, reaching a maximum velocity when 
= — (/) (the central value around which is oscillating), until at a maximum density 
= — (4/3)</), the pressure manages to halt the compression and cause matter to begin 
streaming out of the potential well again. This maximum corresponds to the first 
"Doppler peak", which is often referred to as a compression peak. Note that this is a 
slightly confusing terminology, however, since it makes no sense if one forgets to specify 
where the compression occurs. This peak corresponds to maximum compression in the 
potential wells and maximum rarefaction at the potential peaks. At this point, Q -\- (j) 
= </)/3 in the upper plot. With the same terminology, the initial conditions [t = rj = 0) 
corresponded to a rarefaction peak (maximum rarefaction in the wells, compression at 
the peaks), giving the Sachs- Wolfe contribution Q -\- (j) = — </>/3. 

The lower half of Figure 8 shows the real-world situation, where Vg < c/\/3. 
Since i? > 0, photons no longer dominate the fiuid dynamics completely, so in the 
battle between pressure and gravity, the latter gains some territory compared with the 
Vg = c/\/3 case. Compression will thus proceed further until pressure can reverse it 
(thus (i), the larger oscillation amplitude). Since the initial conditions are the same, 
not just the turning point, but also the zero-point of the oscillations (which is of course 
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half-way down from the initial value to the turning-point), will shift. The result is (ii), 
i.e., that the zero-point of oscillations in exceeds (f) in magnitude, so that there will 
on average be a blueshift from the potential well. 

Effects (i) and (ii) are quite strong even for small changes in the sound speed. 
Plotting [(0 -|- -\- vl\ using equations (18) and (22), lowering Vg by merely 8% is 
sufficient to reproduce some features of Figure 1: the power spectrum: the result is flat 
for small k and then rises to a Doppler peak about four times the height of the flat 
region. However, to make the toy model realistic, we must incorporate the fact that R 
varies with time, as described below. 

We conclude this section by noting that the locations of the peaks correspond to the 
maxima and minima of (/> -|- 0, the joint contributions of the gravity and density terms 
in equation (1), and not to those of the Doppler term u^, whose contribution is zero at 
the peaks. In other words, the term "Doppler peaks" is strictly speaking a misnomer. 
However, since this term is so firmly entrenched in the CMB vocabulary, we will bow 
to convention and use it nonetheless. 



3.1.7. The WKB approximation 



Although the above toy model illustrated the mechanism by which the Doppler peaks are 
generated, it is not accurate enough to be useful in practice. One of the recent advances 
in this field was the realization [7,8] that if the WKB approximation is applied to 
equation (12), extremely accurate solutions can be obtained. The WKB approximation, 
often used in for instance quantum mechanics, reduces the solution of a second order 
linear ordinary differential equation to merely doing integrals, and works when the 
solution oscillates on a time scale much shorter than that on which the coefficients 
change. R changes so slowly that this criterion is generally satisfied in equation (12), 
and the approximate solution is found to be [9] 

V3 



[1 + R{rif'^ Q{ri) = 0(0) cos[kr,{ri)] + 



k 



§(0) + ii?(O)0(O) 



sin[krs{T])\ 



where the .sound ho 



rizon IS 



defined as 



[1 + R{ri')fl^s\n[kr,{ri) - krs{r]')]F{r]')dr]' , (25) 



/ 

rsiv) = / Vs{r]')dr]' 




VTTR + JR + R. 
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1 + 



(26) 



eg 



24000Oo/i^ 



eg 



where i?eg is R at the time of matter-radiation equality, given by z 
and keq is the wavenumber that enters the horizon at that time, given by k, 
(14Mpc)~^Oo^^- This approximation turns out to be so accurate that the weakest 
link becomes making an accurate model for the driving potential F. 
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3.2. During recombination: damping and diffusion 

The above treatment was valid before recombination, when we could model the photon- 
baryon plasma as a single fluid. The fall-off of the power spectrum to the right in 
Figure f is due to the fact that recombination is not instantaneous, and caused by the 
following two effects: 

(i) While the ionization fraction is so low that the photon mean free path is greater 
that the wavelength of a perturbation mode, but not yet so low that the photons 
have decoupled altogether, this mode will get suppressed. This is usually referred 
to as photon diffusion or Silk damping [84], and suppresses fluctuations in both 
photons and baryons. 

(ii) When we measure the temperature in a given direction in the sky, we are averaging 
photons that last scattered near the front and near the back of the last scattering 
surface. This projection effect washes out fluctuations on scales smaller than the 
thickness of the last scattering surface. 

3.3. After recombination: curvature and projection effects 

So far, we have been discussing the evolution of 3D Fourier modes (plane waves). What 
we observe is of course a 2D temperature distribution on the celestial sphere. The 
result of converting the 3D results to a 2D power spectrum (see Appendix A for details 
about spherical harmonics) is that a spherical harmonic aim is a weighted average of 
the 3D fluctuations on many different scales. Although for a given multipole £, the 
weights (sometimes referred to as the window function) tend to peak at a characteristic 
wavenumber A; oc £, it has a non-negligible width. The effect of this is essentially that the 
angular power spectrum will be a smeared out version of the 3D power spectrum, with 
all sharp features slightly softened. In addition, the proportionality constant between 
k and i depends on the curvature of the space, increasing with Oq- In other words, if 
we decrease Oq, the angular power spectrum of Figure I will basically shift to the right, 
since the same physical scale on the last scattering surface will subtend a smaller angle 
in the sky. Although these physical scales of course also depend slightly on Oq, and 
not all in the same way, the overall scaling is roughly that given by equation (3), i.e., 
£ oc Oo'^'- 

3.4. Parameter dependence 

Let us conclude our discussion of Doppler peaks by summarizing what we care the 
most about: how the details of the power spectrum depend on various cosmological 
parameters. 
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Oo As mentioned above, lowering Oq primarily shifts the power spectrum to the right 
due to the curvature effect. In addition, the lowest multipoles get boosted by the 
late ISW effect. 

Ofc As we saw in section 3.1.5, the power spectrum had no peaks if i? = 0. Since 
R oc h'^ilf,^ increasing the baryon fraction will thus make the peaks higher. In 
addition, as was illustrated in Figure 8, this boosts the compression peaks (number 
I, 3, etc.) much more than the rarefaction peaks (number 2, 4, etc.). This effect is 
illustrated in Figure 9. 

h The dependence on the Hubble constant when h^ilf, is held fixed at the nucleosynthesis 
value h^ilf, = 0.015 is shown in Figure 10. Potential decay due to radiation pressure 
inside the horizon during radiation domination boosts the peaks via the F-term, 
so lowering h will increase this boost by delaying matter-radiation equality. By 
changing the expansion rate, this also shifts the peaks somewhat. 

A Increasing the cosmological constant while keeping space fiat (Oq + A = f) will among 
other things boosts the lowest multipoles via the late ISW effect. 

n Increasing n, the tilt of the primordial power spectrum of (/>, will of course increase 
the slope of the angular power spectrum Ci as well, raising the right side relative 
to the left side. 

T Farly reionization will, as discussed in section 2.2.3 suppress the power at £ ^ 10. 



Some of these effects are illustrated in Figure 2. Here an arrow next to a quantity 
indicates the direction in which the curve with the same line type as the arrow will shift 
if the parameter is increased. 





Figure 10. The effect of clianging tlie Hubble constant. From top to bottom, tlie 
curves correspond to h = 0.3, 0.5 and 0.8, respectively. Reprinted from [9]. 
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4. How accurately can the CMB pin down cosmological 
parameters? 

Above we have discussed the various fluctuation sources hsted in Table 1. It was seen 
that power spectra such as that in Figure 1 depend on virtually all key cosmological 
parameters, which means that accurate measurement of the power spectrum should 
enable us to make estimates of all these parameters. In this section, we discuss the 
usefulness of the CMB for parameter estimation, and describe how to assess the accuracy 
with which the different parameters can be measured. For additional discussion of these 
issues, see [85,73,86]. 

4.1. The parameter estimation problem 

Let us deflne aim as the the observed multipole coefficients (including both the noise 
contribution and cosmic variance). This is a priori a vector of random variables with 
zero mean, (a^m) = 0. Let us for the moment ignore the complication of the galactic 
plane, and assume that we have complete sky coverage. The covariance matrix will then 
be diagonal, and, using equation (8), is given by 

) — Stt'Smm'Ci"^, (27) 

where 

Clot^^^^^^eli(i+i)_ (28) 

Here Ci denotes the true CMB power spectrum, which depends on some set of 
cosmological parameters that we will denote = (^1,^2, •••,^n)- We might for instance 
consider an n-parameter model where n = 9 and the parameters are 

= (g, Oo, ^bh\ h, n. A, T/S, r); (29) 

the quadrupole normalization, the density parameter, the baryon density, the Hubble 
parameter, the spectral index, the cosmological constant, the ratio of tensor to scalar 
modes at £ = 2 and the optical depth due to reionization, respectively. 

Suppose that we have measured aim for all £ up to £max- The standard way to 
estimate these n parameters simultaneously from the observed multipole coefficients 
aira is to compute the maximum likelihood (ML) estimate, the value of the vector 
that maximizes the likelihood function L{aira^G>)- This is of course equivalent to 
minimizing the quantity C = — 21niv. L is simply the probability distribution for the 
random vector aim given the model parameters 0. With the standard assumption that 
the random fields are Gaussian, aim is just a vector of independent Gaussian random 
variables with variances given by equation (27), and dropping an irrelevant additive 
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constant, we obtain 



£= ^(2£ + i: 



1=2 



(30) 



where the observed power spectrum Ci is defined as 



1 ' 



Y XI l^^mT- (31) 



2£ + . „ 

m =—t 



4.2. The error bars 

Since is a random variable, so is the estimated parameter vector 0. A convenient 
measure of the resulting errors is thus the covariance matrix of 0, defined as 

M = ((0-0o)(0-0o)), (32) 

where 0o = (0). If the ML-estimates are unbiased, then 0o equals the true parameter 
values. If not, the bias is normally corrected for with Monte-Carlo simulations. 

Let us Taylor expand C around the ML-estimate 0. By definition, all first derivatives 
dC/dOi will vanish at the ML-point, so that the local behavior will be dominated by 
the quadratic terms. Since L = exp[— £/2], we see that the likelihood function will be 
approximately Gaussian near the ML-point. If the error bars are quite small, L usually 
drops sharply before third order terms have become important, so that this Gaussian is a 
good approximation to L everywhere. The covariance matrix M is then given simply by 
the second derivatives at the ML-point, as the inverse of the Hessian matrix: M = /~^, 
where 

= dMfi ^^^^ 

In statistics, (/) is known as the Fisher information matrix, and it can be proved that 
(/)~^ in a certain sense gives the smallest error bars that can possibly be extracted from 
the data, regardless of whether ML-estimation or some other parameter fitting method 
is used [87,88]. A straightforward calculation shows that at the ML-point, 



max 



47rcr2 .2, 




(34) 



'J 1=2 

This handy expression, also given by [86], tells us that the crucial functions which 
determine the attainable accuracy are the derivatives of the power spectrum with respect 
to the various parameters. Examples of such derivatives are shown in Figure 11, and 
the reader is encouraged to try to interpret the behavior or these curves in terms of 
our discussions of parameter dependence in the previous sections. For instance, dCi/dr 
is shaped as —Ci for i ^ 10, since earlier reionization suppresses all these multipoles 
by the same factor e~^^. dCi/dQ of course has the same shape as the power spectrum 
itself. 
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4.3. A geometric interpretation 

Equation (34) has a simple geometric interpretation. We can think of the n functions 
dCi/dOi as vectors in a Hilbert space of dimension (£max — 1), and think of lij as 
simply the dot product of the vectors dCijdOi and dCijdOi. This dot product gives 
a rather small weight to low £- values, essentially because there are fewer m-modes there 
and correspondingly larger cosmic variance. In addition, the weights are seen to be 
exponentially suppressed for large £, where beam dilution causes the effect pixel noise 
to explode. If the parameter dependence of Ci was such that all n vectors dCi/dOi were 
orthogonal under this dot product, then / and M would be diagonal, and the errors 
in the estimates of the different parameters would be uncorrelated. The more similarly 
shaped two curves in Figure 11 are, the harder it will be to break the degeneracy 
between the corresponding two parameters. In the extreme case where two curves have 
identical shape (are proportional to each other), the matrix / becomes singular and the 
resulting error bars on the two parameters become infinite. It is therefore interesting to 
diagonalize the matrix M (or equivalently, its inverse /). The eigenvectors will tell us 
which n parameter combinations can be independently estimated from the CMB data, 
and the corresponding eigenvalues tell us what how accurately this can be done. 

It has been pointed out [85] that there will be a considerable parameter degeneracy 
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if data is only availably up to around the first Doppler peak. This is clearly illustrated 
in Figure f f : most of the curves lack strong features in this range, so some of them can 
be well approximated by linear combinations of the others. If a CMB experiment has 
high enough resolution to measure the power out to / ~ 10"^, however, this degeneracy is 
broken, and for the 9-parameter model we discussed, most parameters can be estimated 
to an accuracy of a few percent or better assuming typical noise levels. This agrees well 
with the results of [86], who compute the attainable error bars on Oq, and [73] where 
the attainable error bars on n and T/ S are estimated. Even the abundance of hot dark 
matter can be constrained in this fashion [83]. 

In should be stressed that the formalism above only tells us what the attainable 
accuracy is if the truth is somewhere near the point in parameter space at which we 
compute the power derivatives. Figure 11 corresponds to standard COBF-normalized 
CDM, i.e., to = (20/iK, 1, 0.015, 0.5, 1, 0, 0, 0). The worst scenario imaginable would 
probably be extremely early reionization, since r ^ 1 would eliminate almost all small- 
scale power and introduce a severe degeneracy by making all the power derivatives 
almost indistinguishable for £ ^ 10, the region where they differ the most in Figure 11. 

Needless to say, accurate parameter estimation also requires that we can compute 
the theoretical power spectrum accurately. It has been argued [80] that this can be done 
to better than 1%, by accurately modeling various weak physical effects such as Helium 
recombination and polarization. 

4.4. Real-world complications: foregrounds and incomplete sky coverage 

Above we ignored two ubiquitous real- world problems: foregrounds will always be 
present at some level, and all-sky data will not be available (because of the hopeless 
foreground problems in the galactic plane). 

4.4.1. Foregrounds 

From the structure of equation (34), the impact of residual foreground contamination (or 
the presence of any other fluctuation source of known amplitude) is clear: the foreground 
power Ci simply gets added to the expression in square brackets. For instance, strong 
foreground fluctuations around i = 300 would reduce the weight given to those £-values 
in the dot product. Thus even if the various curves in Figure 11 look quite different 
there, these differences would carry little statistical weight, degeneracy problems would 
increase and error bars on the parameter estimates would grow. 

Fstimates of residual foreground contamination are shown in Figure 12. They are 
based on the foreground models of section 2.3 and the specifications of the proposed 
9-channel COBRAS/SAMBA satellite, and are further described in [19]. The top curve 
(shaded) is a standard CDM power spectrum. From top to bottom, the remaining curves 
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Figure 12. The residual foreground contamination with various subtraction schemes. 



show the residuals obtained with four different subtraction schemes, of increasing levels 
of sophistication. The uppermost one corresponds to no subtraction whatsoever, and 
using observations in merely a single frequency channel, 143 GHz. The second one also 
involves no subtraction, but using the channel that has the smallest total foreground 
signal at each multipole. The optimal frequency as a function of £ is shown by the heavy 
solid line in Figure 7, and is seen to increase with frequency, reflecting the geography of 
the i — u plane. The third curve (dashed) is based on the multi-frequency subtraction 
scheme of [19], applied on a pixel-by-pixel basis. It essentially involves fltting for the 
various foreground components using the data at all frequencies, but in addition takes 
noise levels into account to minimize the r.m.s. of the residuals. The fourth curve 
uses the same method [19], but applied in Fourier space instead of in real space, i.e., 
subtracting mode by mode rather than pixel by pixel. By taking advantage of the i- 
dependence of the foregrounds, this is seen to reduce the residuals by another factor of 
10. If our foreground models turn out to be roughly correct, the results are of course 
quite encouraging. Since the power is the square of the quantity on the vertical axis, 
even the extremely simple-minded "best channel" approach would give C^-errors at the 
1% level, and it seems quite plausible that subtraction methods will allow us to reduce 
foreground contamination to negligible levels all the way out past the flrst few Doppler 
peaks. 
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4.4.2. Incomplete sky coverage 

The reason that equation (34) has such a simple form is that the covariance matrix in 
equation (27) is diagonaL This happens only if we have complete sky coverage. Even if a 
CMB experiment produces an all-sky CMB map, the presence of "dirty" regions such as 
the Galactic plane, the Large Magellanic Cloud, bright point sources, etc., means that 
we may want to throw away some of the pixels, leaving us with a map with a topology 
reminiscent of a Swiss cheese. Incomplete sky coverage impedes parameter estimation 
in two different ways, related to sample variance and spectral smearing. 

Firstly, the sample variance increases, since we are effectively probing fewer 
independent modes. The sample variance essentially grows as [60] ^47r/0, where is 
the solid angle covered by the map, so this effect will basically increase the C^-term in 
square brackets in equation (34) by this factor. 

Secondly, it is well-known that it is impossible to compute the exact coefficients aim 
using merely part of the sky. Instead of the spherical harmonics, we must expand our 
maps in some other set of basis functions, functions that vanish in all the "holes" in 
our maps. In contrast to the spherical harmonics, each of these functions will inevitably 
probe a range of £-values, rather than just a single multipole, specified by a window 
function. The result will be that the functions in Figure II will have their features 
smeared out on scales smaller than the window function width, thus reducing the 
differences between them and increasing the parameter degeneracy. Because of this, we 
clearly want these window functions to be as narrow as possible. 

A prescription for how to calculate such basis functions, taking incomplete sky 
coverage, pixelization, and position-dependent noise into account, is given in [16], and 
it is found that given a patch of sky whose smallest angular dimension is A^, each basis 
function will probe an £-band of width Ai 60° / AO. For instance, if we restrict our 
analysis to a 10° X 10° square, then Ai 6. This is very good news. It means that the 
only performance degradation in our parameter estimation will stem from the fact that 
we are unable to take advantage of sharp features in the power spectra of width Ai 6 
or smaller. This is essentially no loss at all, since all mainstream models predict fairly 
smooth power spectra, without any sharp spikes or discontinuities. 
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5. Summary 

We have attempted to summarize the many sources of CMB anisotropies, ranging from 
useful primary and secondary sources that probe cosmological parameters to "tertiary" 
foreground sources that are an impediment to parameter estimation. We have also 
described how to assess the accuracy with which these parameters can be measured. 
We summarize the current situation as follows. 

(i) The CMB power spectrum depends sensitively on virtually all key cosmological 
parameters, which means that future CMB experiments have the potential to 
measure these parameters with hitherto unprecedented accuracy. 

(ii) Galactic and extragalactic foregrounds tend to differ from the CMB not only in 
their frequency spectra but also in their power spectra, as summarized in Figure 7. 
We can take advantage of this to subtract them out more accurately. 

(iii) By combining high resolution CMB maps from a wide range of frequencies, it 
appears feasible to reduce residual foreground contamination to negligible levels in 
a substantial fraction of the sky. 

(iv) Incomplete sky coverage is not likely to substantially reduce our ability to constrain 
cosmological parameters. It merely increases the sample variance somewhat and 
eliminates our ability to take advantage of sharp features in the power spectrum 
— and none of the mainstream models predict such sharp features. 

(v) Within a CDM framework, the only scenario that would ruin our ability to measure 
cosmological parameters is extremely early reionization. Fortunately, recent CMB 
data on degree scales appear to rule out this possibility [3]. 

In conclusion, it appears as though CMB workers have reason to view the future with 
mild optimism. 
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Appendix A. Cosmic statistics — a primer 

In this appendix, we review some of the mathematical underpinnings of CMB modehng 
— random fields, power spectra, spherical harmonics, etc. 

A.l. Random fields 

The mathematical theory of random fields (sometimes known as three-dimensional 
stochastic processes) is a very useful tool when analyzing cosmological structure 
formation. A random field is simply an infinite-dimensional random variable, such that 
each realization of it is a field on some space. In cosmology applications, these fields 
tend to be ^, v or (/>, and the space is physical space at some fixed time t (or in the 
CMB case, the surface of the celestial sphere). As every quantum field theorist knows, 
it is a nightmare to try to define a nice measure on an infinite-dimensional space, so 
random fields are defined by specifying the joint probability distribution of their values 
at any n points, n = 1,2,3,..., thus circumventing the need to define a probability 
distribution on the space of all fields. Hence, to define a random field ^, one must 
specify the 1-point distribution (the 1-dimensional probability distribution of ^(xi) for 
all Xi), the 2-point-distribution (the 2-dimensional probability distribution of the vector 
[(^(xi), <^(x2)] for all Xi and X2), etc. In cosmology, the random fields are always assumed 
to be translationally and rotationally invariant, i.e. homogeneous and isotropic. Hence 
the 1-point distribution is independent of Xi, and the 2-point distribution will depend 
only on the scalar quantity x = |x2 — Xi|. 

A. 1.1. Ergodicity 

A random field 5 is said to be ergodic if one can use ensemble averaging and spatial 
averaging interchangeably. The ensemble average of a random field 5 at a point, denoted 
((^(x)), is simply the expectation value of the random variable ^(x). Thus for an ergodic 
field, 

(^(xi)) = lim (^ttrA ' / S4,^)d'x 

R^co V3 / J|x|<_R 



37 



holds for all points Xi and for all realizations ^*(x) (except for a set of probability 
measure zero). Ensemble averages are completely inaccessible to us, since we have only 
one universe to look at, namely the particular realization that we happen to live in. So 
as cosmologists, we are quite happy if we have ergodicity, since this means that we can 
measure these elusive ensemble averages by simply averaging over large volumes instead. 



A. 1.2. Gaussianity 

A random field is said to be Gaussian if all the above-mentioned probability distributions 
are Gaussian. This is a very popular assumption in cosmology, partly because, as we 
will see, it greatly simplifies matters. A first nice feature of this assumption is that 
all homogeneous and isotropic Gaussian random fields are ergodicf . Taking the spatial 
average of the definition of ^, for instance, ergodicity implies that 

(^(x)) = 0. 

Let us define the correlation function as 

ax) ^ (^(X2)^(xi)). 

(Note that because of the homogeneity and isotropy, the correlation function depends 
only on X = |x2 — Xi|.) Since the n-point distribution is Gaussian, it is defined by its 
mean vector (^(x„)) (which is identically zero) and its covariance matrix 

Cmn = (^(X„)^(X„)) = ^(|X„ - X„|). 

Thus the Gaussian random field 5 has the extremely useful property that it is is entirely 
specified by its correlation function. 



A. 2. The 3D power spectrum 

If we Fourier expand 5 as 



I Note that this is true only for random fields that live on infinite spaces such as R" , and does not 
hold for fields on compact manifolds such as the sphere . Thus the field of microwave background 
anisotropics (defined further on) is not ergodic, so that even if we could reduce our experimental errors 
to zero, we could still never measure any ensemble-averages with perfect accuracy. This phenomenon 
is known as "cosmic variance". It stems from the fact that in the spatial average above, one cannot 
average over an infinite volume (i.e. let R co), since the volume of the compact manifold (in this 
case the area of the sphere) is finite. 
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we see that if its Fourier transform 

is a Gaussian random variable for any A;, then 5 will automatically be a Gaussian random 
field (since a sum of Gaussians is always Gaussian). Cosmologists like to postulate 
that the complex numbers ^(k) have random phases^ which implies that they are all 
uncorrelated. One postulates that 

(^(kr^(k')) = (2^mk-k')p(k), 

where the function P(k) is called the power spectrum and 5 is the Dirac delta function 
(which will hopefully not be confused with the random field 5). This implies that even 
if <^(k) does not have a Gaussian distribution, the random field ^, being an infinite sum 
of independent random variables, will still be Gaussian by the Central Limit Theorem 
for many well-behaved power spectra. It is straightforward to show that the power 
spectrum is simply the three-dimensional Fourier transform of the correlation function, 
i.e. 

P{k) = 47tJ (^^^^ iirYdr. 

Note that P depends on k only through its magnitude k = |k|, because of the isotropy 
assumption. 



A. 3. Working with spherical harmonics 

For functions that live on a sphere, the analogue of a Fourier expansion is an expansion 
in .spherical harmonics. The spherical harmonics are defined as 



2£ + l(£-m)! 
\\ Att {i + m)\ 



P^{cose)e' 



where are the associated Legendre functions, and £ and m are integers such that 
£ > and |m| < L They have the symmetry property that 



- £m 1 



where * denotes complex conjugation. These functions form a complete orthonormal 
set on the unit sphere. The orthonormality means that 



and the completeness means that we can expand any function A as 
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oo t 
1=0 m = -£ 

where 

atm = jY;j9,^)A{9,^)dn. 
We use the differential solid angle notation 

c/0 = sin 6 d6d(f. 

Also, we will sometimes replace 6 and (f by the unit vector 

n = (sin 6 cos sin 6 sin cos 6) 

and write things like 

With this notation, the so called addition theorem for spherical harmonics states that 

^ 2/ + 1 

E Y;jn)YUn') = ^Piin-n'), 

m =—i 

where Pi = P° are the Legendre polynomials. 

For the reader who wants more intuition about spherical harmonics, it is good to 
lump together all harmonics with the same i value. For £ = 0, f, 2, 3 and 4, these 
sets are referred to as the monopole, the dipole, the quadrupole, the octupole and 
the hexadecapole, respectively. Geometrically, when one expands a function /(n) in 
spherical harmonics, £ = picks up the constant part, i = 1 picks up the remaining linear 
part, i = 2 picks up the remaining quadratic part, £ = 3 picks up the remaining cubic 
part, etc. In group theory jargon, the spherical harmonics corresponding to different 
£-values are the irreducible representations of the group of rotations of the sphere. This 
means that if one expands a rotated version of the same function /, the new spherical 
harmonic coefficients will be some linear combination of the old ones a^m, 

i 

such that different /-values live separate lives and never mix. For instance, however one 
chooses to rotate a linear function (£ = I), it will always remain linear and never say 
pick up quadratic terms. 
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A. 4. Random fields on the sphere 

r^(h), the CBR temperature that we observe in the direction h in the sky, is modeled 
as a random field. It is more convenient to work with the dimensionless version 



which is often denoted AT/T. All the formulas and definitions we gave for 5 have 
spherical analogues for A, as we will now see. The angular correlation function is 
defined as 



where h • h' = cos^, and the right hand side is independent of the actual directions by 
the isotropy assumption. It is easy to show that this implies random phases^ that 



for some coefficients Ci. These coefficients Ci constitute the spherical version of the 
power spectrum P{k), and are usually referred to as the angular power spectrum. Just as 
the spatial correlation function was the Fourier transform of the spatial power spectrum, 
the angular correlation function is what might be called a "Legendre transform" of the 
angular power spectrum Ci. Using the addition theorem, one readily obtains 




c{e) = (A(h)A(h')), 




1 



CO 



c{9) 



^(2/ + l)C,P,(cos^). 



